simranjit singh
An LLM-Tool Compiler for Fused Parallel Function Calling
Singh, Simranjit, Karatzas, Andreas, Fore, Michael, Anagnostopoulos, Iraklis, Stamoulis, Dimitrios
State-of-the-art sequential reasoning in Large Language Models (LLMs) has expanded the capabilities of Copilots beyond conversational tasks to complex function calling, managing thousands of API calls. However, the tendency of compositional prompting to segment tasks into multiple steps, each requiring a round-trip to the GPT APIs, leads to increased system latency and costs. Although recent advancements in parallel function calling have improved tool execution per API call, they may necessitate more detailed in-context instructions and task breakdown at the prompt level, resulting in higher engineering and production costs. Inspired by the hardware design principles of multiply-add (MAD) operations, which fuse multiple arithmetic operations into a single task from the compiler's perspective, we propose LLM-Tool Compiler, which selectively fuses similar types of tool operations under a single function at runtime, presenting them as a unified task to the LLM. This selective fusion inherently enhances parallelization and efficiency. Benchmarked on a large-scale Copilot platform, LLM-Tool Compiler achieves up to four times more parallel calls than existing methods, reducing token costs and latency by up to 40% and 12%, respectively.
- North America > United States > Illinois > Jackson County > Carbondale (0.04)
- Europe (0.04)
- North America > United States > Washington > King County > Redmond (0.04)
- (3 more...)
GeckOpt: LLM System Efficiency via Intent-Based Tool Selection
Fore, Michael, Singh, Simranjit, Stamoulis, Dimitrios
In this preliminary study, we investigate a GPT-driven intent-based reasoning approach to streamline tool selection for large language models (LLMs) aimed at system efficiency. By identifying the intent behind user prompts at runtime, we narrow down the API toolset required for task execution, reducing token consumption by up to 24.6\%. Early results on a real-world, massively parallel Copilot platform with over 100 GPT-4-Turbo nodes show cost reductions and potential towards improving LLM-based system efficiency.
- North America > United States > Florida > Pinellas County > Clearwater (0.06)
- North America > United States > Washington > King County > Redmond (0.05)
- North America > United States > Virginia > Fairfax County > Reston (0.05)
- (2 more...)
Evaluating Tool-Augmented Agents in Remote Sensing Platforms
Singh, Simranjit, Fore, Michael, Stamoulis, Dimitrios
Tool-augmented Large Language Models (LLMs) have shown impressive capabilities in remote sensing (RS) applications. However, existing benchmarks assume question-answering input templates over predefined image-text data pairs. These standalone instructions neglect the intricacies of realistic user-grounded tasks. Consider a geospatial analyst: they zoom in a map area, they draw a region over which to collect satellite imagery, and they succinctly ask "Detect all objects here". Where is `here`, if it is not explicitly hardcoded in the image-text template, but instead is implied by the system state, e.g., the live map positioning? To bridge this gap, we present GeoLLM-QA, a benchmark designed to capture long sequences of verbal, visual, and click-based actions on a real UI platform. Through in-depth evaluation of state-of-the-art LLMs over a diverse set of 1,000 tasks, we offer insights towards stronger agents for RS applications.
- North America > United States > Washington > King County > Redmond (0.04)
- North America > Mexico > Mexico City > Mexico City (0.04)
- North America > Barbados (0.04)
- Europe > Italy > Marche > Ancona Province > Ancona (0.04)